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MICHAEL LACEY 

Abstract. Hankel operators lie at the junction of analytic and real-variables. We will explore this 
junction, from the point of view of Haar shifts and commutators. 



1 . Haar Functions 

We consider operators which satisfy invariance properties with respect to two well-known groups. 
The first group we take to the translation operators 

(1.1) Tv y f(x):=f(x-y), yeR. 

Note that formally, the adjoint operator is (Tr v )* = Tr_ y . The collection of operators {Tr y : y e R} 
is a representation of the additive group (R, +). 

It is an important, and very general principle that a linear operator L acting on some vector 
space of functions, which is assumed to commute with all translation operators, is in fact given as 
convolution, in general with respect to a measure or distribution, thus, 

L/(x) = j f(x-y) n(dy). 

For instance, with the identity operator, fi would be the Dirac pointmass at the origin. 
The second group is the set of dilations on LP , given by 

(1.2) Dilf f(x) := A~ 1/p f(x/A) , < A,p < oo . 

Here, we make the definition so that = HDif^ f\\ p . The scale of the dilation DiLr is said to be 
A, and these operators are a representation of the multiplicative group (R + , *). The Haar measure 
of of this group is dy/y. 

Underlying this subject are the delicate interplay between local averages and differences. Some 
of this interplay can be encoded into the combinatorics of grids, especially the dyadic grid, defined 
tobeD:={2 k (j,j+l) : j,keZ}. 

The Haar functions are a remarkable class of functions indexed by the dyadic grid T>. Set 

h{x) = -l(_i/2,0) + 1(0,1/2) , 
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Figure 1. Two Haar functions. 

a mean zero function supported on the interval (-1/2, 1 /2), taking two values, with L 2 norm equal 
to one. Define the Haar function (associated to interval T) to be 

(1.3) hi:=m\)hi 

( 1 .4) Dikf : = Tr c(7) Dil , c(I) = center of /. 

Here, we introduce the notion for the Dilation associated with interval I. 

The Haar functions have profound properties, due to their connection to both analytical and 
probabilistic properties. An elemental property is that they form a basis for L 2 (R). 

1.5. Theorem. The set of functions {l[o,i]} U {/?/ : I e D, I c [0, \]}form an orthonormal basis for 
L 2 ([0, 1]). The set of functions [hj : I £ D}form an orthonormal basis for L 2 (R). 

2. Paraproducts 

Products, and certain kind of renormalized products are common objects. Let us explain the 
renormalized products in a very simple situation. We begin with the definition of a paraproduct, as 
a bilinear operator. Define 

(2.1) h] = h I , ft} = |^|=Dil?l t _ 1A1/ 2,. 

The superscript indicates a mean-zero function, while the superscript 1 indicates a non-zero inte- 
gral. Now define 



(2.2) P ei '^C/i,/ 2 ) := J" -^i=rif2,hphf , ej e {0, 1}. 



For the most part, we consider cases where there is one choice of ej which is equal to one, but in 
considering fractional integrals, one considers examples where all e, are equal to one. The triple 
(ei, Ei, 63) is the signature of the Paraproduct. 

We have chosen this definition for specificity, but at the same time, it must be stressed that 
there is no canonical definition, and the presentation of a paraproduct can differ in a number of 
ways. Whatever the presentation, its single most important attribute is its signature. Indeed, in 
Proposition [531 we will see that a paraproduct arises from a computation that, while not of the form 
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above, is clearly an operator of signature (0,0,0). All the important prior work on commutators, 
see [Hll2l[6lHI!l can be interpreted in this notation. (The Lectures of M. Christ [5] are recommended 
as a guide to this literature.) For instance, in the notation of Coifman and Meyer jMZL a P t denotes 
a l , while a Q t denotes a °. 

Why the name paraproduct? This is probably best explained by the identity 

(2.3) /i • f 2 = P 1M (fufi) + P 0A1 C/i,/2) + P 0,1 '°C/i,/2) • 

Thus, a product of two functions is a sum of three paraproducts. The three individual paraproducts 
in many respects behave like products, for instance we will see that there is a Holder Inequality. 
And, very importantly, in certain instances they are better than a product. 
To verify (|2.3I) . let us first make the self-evident observation that 

(2.4) 1 f g(y) dy = = V (g, hj)hj{I) , 

where hj(J) is the (unique) value h j takes on /. In (12.31) . expand both fi and f 2 in the Haar basis, 

Split the resulting product into three sums, (1) / = J, (2) / c J (3) J c /. In the first case, 

2 (fuh I )(f 2 , hjXh,) 2 = P 0A1 (/i, / 2 ) • 

/,/ : I=J 

In the second case, use (12.41) . 

X (fuhdifMhj ■ 1 f hj(y) dy = Yifuh^-^^hj 
iTia J ' i Wl 

= P° J ^(/l,/2). 

And the third case is as in the second case, with the role of f\ and f 2 switched. 

A rudimentary property is that Paraproducts should respect Holder's inequality, a matter that we 
turn to next. This Theorem is due to Coifman and Meyer (6113. Also see [fT4l[T7l[T8ll . 

2.5. Theorem. Suppose at most one ofe\,e 2 , 63 are equal to one. We have the inequalities 

(2.6) ||P £l ' £2 ' £3 (/i,/ 2 )ll, < II/1IUII/2IU , 1 < Px,Pi < 00, 1/q = Hp, + l/p 2 . 

3. Paraproducts and Carleson Embedding 

We have indicated that Paraproducts are better than products in one way. These fundamental 
inequalities are the subject of this section. Let us define the notion of (dyadic) Bounded Mean 
Oscillation, BMO for short, by 

_ nl/2 

(3.1) II/IIbmo = sup 



\JV l J](f,h R y 



IcJ 
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3.2. Theorem. Suppose that at exactly one of €2 and 63 are equal to 1. 

(3.3) ||P°' £2 ' £3 (/i, -)||^ - II/iIIbmo , 1< P < 00 . 
Indeed, we have 

(3.4) ||P ai 'Vi, 0|| w - sup||P°^Vi, l-/r 1/p l/)|| p - II/iIIbmo • 

Here, we are treating the paraproduct as a linear operator on f 2 , and showing that the operator 
norm is characterized by H/iIIbmo- Obviously, ||/||bmo ^ 2H/IU, and again this a crucial point, there 
are unbounded functions with bounded mean oscillation, with the canonical example being In x. 
Thus, these paraproducts are, in a specific sense, better than pointwise products of functions. 

Proof. The case p = 2 is essential, and the only case considered in these notes. This particular case 
is frequently referred to as Carleson Embedding, a term that arises from the original application of 
the principal in the Corona Theorem. 

Let us discuss the case of P 0,1,0 in detail. Note that the dual of the operator 

/ 2 — »pW^(/ 1 ,/ 2 ), 

that is we keep f\ fixed, is the operator P ' 0,1 (/i, ■)> so it is enough to consider p 010 in the L 2 case. 



One direction of the inequalities is as follows. 

l J)\\p 



HP°' e2 ' e3 (/l,-)ll2^2 > SU P ||P°^^(/l,^^ 



^ II/iIIbmo 

as is easy to see from inspection. Thus, the BMO lower bound on the operator norm arises solely 
from testing against normalized indicator sets. 

For the reverse inequality, we compare to the Maximal Function. Fix fufi, and let 

Dk = {IeD : !%ML 2 *} 

wl 

Let T>\ be the maximal intervals in D k . The L 2 -bound for the Maximal Function gives us 



(3.5) ^2 2 ^|r|<||M/ 2 ||2< 

k 

Then, for I* e D* k we have 



|^</iA>2% [ = 2 2 ^</i,/* 7 > 2 

Id* Id* 

<2 2k \\Mluo\r\ 



And so we are done by (13.51) . 



□ 
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Figure 2. A Haar function hi and its dual gi. 



4. Hilbert Transform 

It is a useful Theorem, one that we shall return to later, that the set of operators L that are bounded 
from L 2 (R.) to itself, and commute with both translations and dilations have a special form. They 
are linear combinations of the Identity operator, and the Hilbert transform. The latter operator, 
fundamental to this study, is given by 

dy 



(4.1) 



a fix) 



:=p.v.J 



f(*-y) 



y 



Here, we take the integral in the principal value sense, as the kernel 1 /y is not integrable. Taking 
advantage of the fact that the kernel is odd, one can see that the limit below 



(4.2) 



lim f 

Je<y|<l/e 



ft \ dy 

f(x - y) — 

y 



exists for all x, provided / is a Schwartz function, say. Thus, H has an unambiguous definition 
on a dense class of functions, in all U . We shall take (14.21) as our general definition of principal 
value. The Hilbert transform is the canonical example of a singular integral, that is one that has to 
be defined in some principal value sense. 

Observe that H, being convolution commutes with all translations. That is also commutes with 
all dilation operators follows from the observation that 1 jy is a multiple of the multiplicative Haar 
measure. It can also be recovered in a remarkably transparent way from a simple to define operator 
based upon the Haar functions. Let us define 

(4.3) g = -l(_i/4_i/4) + l(_i/4 > i/4) - 1(1/4,1/2) 

(4.4) =2- 1/2 {ftf_,/ 2 . m + fc fl 



(4.5) 



"{fy-1/2,0) 

$>f = Yj(f,hi)gi, 



'(0,1/2)) 



where as before, gi = Dif} 2) g. It is clear that § is a bounded operator on L 2 . What is surprising is 
that that it can be used to recover the Hilbert transform exactly. The succinct motivation for this 
definition is that H(sin) = cos, so that if hi is a local sine, then gi is a local cosine. 

4.6. Theorem (S. Petermichl [ 20 ]). There is a non-zero constant c so that 



(4.7) 



H = c lim 



Jo \\ 



Tr.Dilf §Dil-Tr_ y ^ ^ 



;i(2) 



dX dy 
TT 
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As a Corollary, we have the estimate \\H\\ 2 < 1, as § is clearly bounded on L 2 . 

The operator t) is referred to as a Haar shift or as a dyadic shift ( Il22l0 . Certain canonical singular 
integrals, like the Hilbert, Riesz and Beurling transform admit remarkably simple Haar shift vari- 
ants, which fact can be used to prove a range of deep results. See for instance [8l|2Tl|23l|24]|. For 
applications of this notion to more general singular integrals, see lfT3l Section 4]. 

Proof. Consider the limit on the right in (|4.7I) . This is seen to exist for each x e R for Schwartz 
functions /. While this is elementary, it might be useful for us to define the auxiliary operators 



Tjf-=Yj(f,hj)gj 



\I\<V 

The individual terms of this series are rapidly convergent. As |/| becomes small, one uses the 
smoothness of the function /. As |/| becomes large, one uses the fact that / is integrable, and 
decays rapidly. Call the limit H/. 

Let us also note that the operator Tj is invariant under translations by an integer multiple of 2K 
Thus, the auxiliary operator 

2~ j f Tr„,/Tr, dt 
Jo 

will be translation invariant. Thus H is convolution with respect to a linear functional on Schwartz 
functions, namely a distribution. 

Concerning dilations, T is invariant under dilations by a power of 2. Now, dilations form a group 
under multiplication on R + , and this group has Haar measure d6/6 so that the operator below will 
commute with all dilations. 

Dil^TDil^ 

Thus, H commutes with all dilations. 

Therefore, H must be a linear combination of a Dirac delta function and convolution with 1 /y. 
(The function l/\y\ is also invariant under dilations, but the inner product with this function is not 
a linear functional on distributions.) Applying H to a non negative Schwartz function yields a 
function with zero mean. Thus, H must be a multiple of convolution with 1 /y, and we only need to 
see that it is non zero multiple. 

Let us set Gj to be the operator 



Gj f := Tran, ^ (Tran_ ; /, h^hj — . 

JO r cri Z 



IE® 
\I\=2' 

This operator translates with translation and hence is convolution. We can write G 7 f = Jj* f. By 
the dilation invariance of the Haar functions, we will have y 7 = Dil^ Jq. A short calculation shows 
that 

yoGO =| h 1 (y + O/J/OO dt 
Jo 
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Figure 3. The graph of y an d 

This function is depicted in Figure [3] Certainly the operator £ 7 Gj is convolution with Yuj 7j( x )- 
This kernel is odd and is strictly positive on [0, oo). This finishes our proof. 

□ 

5 . Commutator Bound 

We would like to explain a classical result on commutators. 

5.1. Theorem. For a function b, and 1 < p < oo we have the equivalence 

IP,H]||^ p -||£||bmo, 
where this is the non-dyadic BMO given by 



sup 

/ interval 



I/I"' 



IK'I 



f(y)dy 



dx 



1/2 



We refer to this as a classical result, as it can be derived from the Nehari theorem, as we will 
explain below. The lower bound on the operator norm is found by applying the commutator to 
normalized indicators of integrals, and we suppress the proof. 

Both bounds are very easy, if one appeals to the Nehari Theorem. See our comments on Ne- 
hari's Theorem below. But, in many circumstances, different proofs admit different modifications, 
and so we present a 'real-variable' proof, deriving the upper bound from the Haar shift, and the 
Paraproduct bound in a transparent way. 

Replacing the Hilbert transform by the Haar Shift, we prove 

(5.2) \\[b,m p ^ P <\\b\\BMO 

The last norm is dyadic-BMO, which is strictly smaller than non-dyadic BMO. But Theorem 14.61 
requires that we use all translates and dilates to recover the Hilbert transform, and so the non-dyadic 
BMO norm will be invariant under these translations and dilations. 

The Proposition is that [b, can be explicitly computed as a sum of Paraproducts which are 
bounded. 
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5.3. Proposition. We have 

(5.4) [b, W = P ox \b, §>f) - § o pM.°(i, /) 

(5.5) +P iW (b,<bf)-$>opW> l (b,f) 

—0,0,0 

(5.6) +P (b,f). 

—0,0,0 

In the last line, P (b, f) is defined to be 

—0,0,0 Y-i (b,h°j) a n n 

/€£) 

Each of the five terms on the right are LP -bounded operators on /, provided b e BMO, so that the 
upper bound on the commutator norm in Theorem l5.1l follows as an easy corollary. The paraproduct 
in (15.61) does not hew to our narrow definition of a Paraproduct, but it is degenerate in that it is of 
signature (0, 0, 0), and thus even easier to control than the other terms. 

Proof. Now, [b, §]/ = b&f - 9)(b ■ f). Apply (|2.3I) to both of these products. We see that 

[b,W= Yj p'(W)-£pW)- 

?=(1,0,0),(0,1,0),(0,0,1) 



The choices of e = (0, 1,0), (0, 0, 1) lead to the first four terms on the right in (15.41) . 

The terms that require more care are the difference of the two terms in which a 1 falls on a b. In 
fact, we will have 

V\b,$>f)-h?\bJ) = V fl \bJ). 
To analyze this difference quickly, let us write 

(§/, hi) = sgn(/)</, kstow) 

where Par(7) is the 'parent' of /, and sgn(7) = 1 if / is the left-half of Par(7), and is otherwise -1. 
This definition follows immediately from the definition of gi in (14.31) . Now observe that 

(¥ g (b,m,h° I y = w,'p g (b,h o I )) 



(b,h\) 



W,h°j) 



Vi/f 

(b,h)) 



<MrWsgn(/)- 



And on the other hand, we have 



(^°(b,f),hj) = ( ^^^ sgn(/X/A(/)> 

Comparing these two terms, we see that we should examine the term that falls on b. But a calcula- 
tion shows that 

V2^-/iL(/) = -sgn(/)C(/)- 
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6. The Nehari Theorem 
We define Hankel operators on the real line. On L 2 (K.), we have the Fourier transform 

7(0 = J f(x)e-* x dx. 
Define the orthogonal projections onto positive and negative frequencies 

P*/(*)= f dx. 

Define Hardy spaces H 2 (W) = f P + L 2 (R). Functions / 6 H 2 (W) admit an analytic extension to the 
upper half plane C+. As in the case of the disk, it is convenient to refer to functions in H 2 (H) as 
analytic. 

A Hankel operator with symbol b is then a linear operator from H 2 (C+) to if 2 (C + ) given by 

def 

H fo <p = P + Mb if. This only depends on the analytic part of b. It is typical to include the notation 
C + to emphasize the connection with analytic function theory, and the relevant domain upon which 
one is working. Below, we will suppress this notation. 
The result that we are interested in is: 

6.1. Nehari's Theorem ( lfT9"1 ). The Hankel operator H b is bounded from H 2 to H 2 iff there is a 
bounded function ft with P+b = P+fi. Moreover, 

(6.2) ||H,||= inf PL 

/*:P +y S=P+i> 

Less exactly, we have \\Hb\\ - ||P+ ^Hbmo, where we can take the last norm to be non-dyadic BMO. 

This Theorem was proved in 1954, appealing to the following classical fact. 

6.3. Proposition. Each function f e H l is a product of functions fi,fz 6 H 2 . In particular, f\ and 
fi can be chosen so that 

11/11//. = H/ilMI/ 2 ll// 2 

Given a bounded Hankel operator H&, we want to show that we can construct a bounded function 
/3 so that the analytic part of b and ft agree. 
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This proof is the one found by Nehari [fT9ll . We begin with a basic computation of the norm of 
the Hankel operator H h : 

\\Hb\\ = sup sup I H/,(/r -lp dx 

IMI^i M\ H 2=i J 

= sup sup I P + M fo (/r • Tp dx 



(6.4) 



=1 



p J (p + < 



= sup sup (P + b)\f/ ■ <p dx 



= sup sup ((P + b),ifr-<p) 

\\<P\\ H 2=1 M\ H 2 = 1 

But, the H l = H 2 ■ H 2 , as we recalled in Proposition 16.31 We read from the equality above that the 
analytic part of b defines a bounded linear functional on H l a subspace of L 1 . 

The Hahn Banach Theorem applies, giving us an extension of this linear functional to all of L 1 , 
with the same norm. But a linear function on L 1 is a bounded function, hence we have constructed 
a bounded function fi with the same analytic part as b. 

The calculation (|6.4I) is more general than what we have indicated here, a point that we return to 
below. 

Let us remark that the H p variant of Nehari's Theorem holds. On the one hand, one has H p -H p c 
H 1 , so that the upper bound on the norm ||H fo ||//p^ ffP follows. On the other, Proposition [63] extends 
to the H p -H p factorization, whence the same argument for the lower bound can be used. 

There is a close connection between commutators [b, H] and Hankel operators. Indeed, we have 
(6.5) [Z?,H] = [b,H] = 2P„&P + -2P + £P . 

The two terms on the right can be recognized as two Hankel operators with orthogonal domains 
and ranges. Indeed, keep in mind the elementary identities P^ = P + , P + P- = 0, H = I-2P_, and 
[b, I] = 0. Then, observe 

P + [6,H]P- = -2P + [6,P_]P_ 

= -P + bP 2 +P + P„6P_ = -P + £P_ 
P_[fc,H]P- =P_[fc,P + ]P_ = 
There are two additional calculations, which are dual to these and we omit them. 

7. Further Applications 

The author came to the Haar shift approach to the commutator from studies of Multi-Parameter 
Nehari Theorem H10U16II . The paper [15] surveys these two papers. This subject requires an under- 
standing of the structure of product BMO that goes beyond the foundational papers of S.-Y. Chang 
and R. Fefferman [3l|4]] on the subject. 

In particular, as in Nehari's Theorem, the upper bound on the Hankel operator is trivial, as one 
direction of the factorization result is trivial: H 2 ■ H 2 c H l . The lower bound is however very far 
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from trivial, as factorization is known to fail in product Hardy spaces. Indeed, Nehari's theorem 
is equivalent to so-called weak factorization, one of the points of interest in the Theorem. See 
H10[|15[[T6l for a discussion of this important obstruction to the proof, and relevant references. 

There are different critical ingredients needed for the proof of the lower bound. One of them 
is a very precise quantitative understanding of the proof of the upper bound. It is at this point 
that the techniques indicated in this paper are essential. The fundamentals of the multi-parameter 
Paraproduct theory were developed by Journe lfTTl[12l . The subject has been revisited recently to 
develop novel Leibnitz rules by Muscalu, Pipher, Tao and Thiele H17N18L Also see Ifl4ll . 

An influential extension of the classical Nehari Theorem to a real-variable setting was found by 
Coifman, Rochberg and Weiss Q: Real-valued BMO on R" can be characterized in terms of com- 
mutators with Riesz Transforms. The real-variable setting implies a complete loss of analyticity, 
making neither bound easy. Recently, the author, with Pipher, Petermichl and Wick, have proved 
the multi-parameter extension of the this result [13]. This paper includes in it a quantification of 
the Proposition l5.3l to the higher dimensional setting, for (smooth) Calderon Zygmund operators T: 
[b, T] is a sum of bounded paraproducts, a crucial Lemma in that paper. See lfl"3l Proposition 5.11]. 
Such an observation is not new, as it can be found in e. g. [0Q for instance. Still the presentation of 
Proposition l5.3l in this paper is as simple as any the author is aware of in the literature. 
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